ViSQOL: an objective speech quality model

نویسندگان

  • Andrew Hines
  • Jan Skoglund
  • Anil C. Kokaram
  • Naomi Harte
چکیده

This paper presents an objective speech quality model, ViSQOL, the Virtual Speech Quality Objective Listener. It is a signal-based, full-reference, intrusive metric that models human speech quality perception using a spectro-temporal measure of similarity between a reference and a test speech signal. The metric has been particularly designed to be robust for quality issues associated with Voice over IP (VoIP) transmission. This paper describes the algorithm and compares the quality predictions with the ITU-T standard metrics PESQ and POLQA for common problems in VoIP: clock drift, associated time warping, and playout delays. The results indicate that ViSQOL and POLQA significantly outperform PESQ, with ViSQOL competing well with POLQA. An extensive benchmarking against PESQ, POLQA, and simpler distance metrics using three speech corpora (NOIZEUS and E4 and the ITU-T P.Sup. 23 database) is also presented. These experiments benchmark the performance for a wide range of quality impairments, including VoIP degradations, a variety of background noise types, speech enhancement methods, and SNR levels. The results and subsequent analysis show that both ViSQOL and POLQA have some performance weaknesses and under-predict perceived quality in certain VoIP conditions. Both have a wider application and robustness to conditions than PESQ or more trivial distance metrics. ViSQOL is shown to offer a useful alternative to POLQA in predicting speech quality in

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ViSQOL: The Virtual Speech Quality Objective Listener

A model of human speech quality perception has been developed to provide an objective measure for predicting subjective quality assessments. The Virtual Speech Quality Objective Listener (ViSQOL) model is a signal based full reference metric that uses a spectro-temporal measure of similarity between a reference and a test speech signal. This paper describes the algorithm and compares the result...

متن کامل

Measuring and monitoring speech quality for voice over IP with POLQA, viSQOL and p.563

There are many types of degradation which can occur in Voice over IP (VoIP) calls. Of interest in this work are degradations which occur independently of the codec, hardware or network in use. Specifically, their effect on the subjective and objective quality of the speech is examined. Since no dataset suitable for this purpose exists, a new dataset (TCD-VoIP) has been created and has been made...

متن کامل

Vision and Quality of Life: Development of Methods for the VisQoL Vision-Related Utility Instrument

PURPOSE To describe the methods and innovations used in constructing the VisQoL, a vision-related utility instrument for the health economic evaluation of eye care and rehabilitation programs. METHODS The VisQoL disaggregates vision into six items. Utilities were estimated for item worst responses (the worst level for each item, with all other items at their best level) and VisQoL all-worst r...

متن کامل

Improvements in vision‐related quality of life in blind patients implanted with the Argus II Epiretinal Prosthesis

BACKGROUND The purpose of this analysis is to report the change in quality of life (QoL) after treatment with the Argus II Epiretinal Prosthesis in patients with end-stage retinitis pigmentosa. METHODS The Vision and Quality of Life Index (VisQoL) was used to assess changes in QoL dimensions and overall utility score in a prospective 30-patient single-arm clinical study. VisQoL is a multi-att...

متن کامل

Vision and quality of life: the development of a utility measure.

PURPOSE To identify the content for a vision and quality of life-related utility measure (Vision Quality of Life Index [VisQoL]) for the economic evaluation of eye care and rehabilitation programs. METHODS Focus groups of the visually impaired elicited key concepts. Based on these and previous research, 33 items were generated. These were administered to visually impaired adults (n = 70) and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • EURASIP J. Audio, Speech and Music Processing

دوره 2015  شماره 

صفحات  -

تاریخ انتشار 2015